Computerized system and method for teaching and assessing the holistic scoring of open-ended questions

ABSTRACT

A tutorial method for teaching the scoring of open-ended questions holistically includes displaying a student response to a scorer and permitting the scorer to access a rubric containing the rules for scoring that response. The scorer can choose a display form from a handwritten form and a typed text form that retains and originally present errors. Following the scorer&#39;s having entered a score, a model score is displayed so that a scoring efficacy may be determined. Annotations prepared by expert scorers may be accessed to enhance the learning process. In addition, a running correlation between the model and entered scores is calculated and displayed for the scorer over a tutorial session that includes attempts at scoring different responses. The system includes a processor, a workstation, and software for performing the above-described method.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates to systems and methods for teaching scoring and for assessing scoring effectiveness and, more particularly, to such systems and methods for teaching and assessing holistic scoring.

2. Description of Related Art

The automation of test scoring is a complex problem that has generated a great deal of interest, owing to a significant economic pressure to optimize efficiency and accuracy and to minimize human involvement. Open-ended or essay-type questions must typically be scored by a human reader, and thus either the physical test form or a visible image thereof must be available for at least the time required for scoring. In addition, scorers (also referred to as readers or resolvers) must be trained in order to become accomplished in analyzing and scoring the answers to open-ended questions effectively, accurately, and quickly.

Computerized systems for scoring open-ended questions are known in the art. In addition, such systems are known that provide feedback to a scorer on validity, reliability, and speed based upon a standard question and model answer. For example, Clark and Clark et al. (U.S. Pat. Nos. 5,321,611; 5,433,615; 5,437,554; 5,458,493; 5,466,159; and 5,558,521) disclose systems and methods for collaborative scoring, wherein scores of two or more resolvers are compared, and a record is kept of each of the resolver's scores. This group of patents also teach the collection of feedback on a resolver, which includes the monitoring of scoring validity, reliability, and speed. One of the criteria is a calculation of a deviation of the resolver's score and a model score by using “quality items.” Also discussed is an online scoring guide for use by the resolver during scoring.

However, there are no systems and methods known in the art that are specifically directed to the teaching of scoring open-ended questions and to providing scoring rules; model answers, scores, and rationales therefor; and feedback to a resolver during the teaching process.

SUMMARY OF THE INVENTION

It is therefore an object of the present invention to provide a system and method for assessing a scorer's grading of an open-ended question.

It is an additional object to provide such a system and method for interactively assisting a scorer in learning a holistic scoring technique.

It is another object to provide such a system and method for tracking a scorer's progress during a practice session.

It is a further object to provide such a system and method for practicing holistic scoring in a variety of content domains such as, but not limited to, reading, writing, science, and mathematics in the same program.

It is yet another object to provide such a system and method for presenting a set of holistic scoring rules, or rubric, to the scorer.

These and other objects are achieved by the system and method of the present invention. One aspect of the method is for teaching a scorer holistically to score an answer to an open-ended question. Holistic scoring is a technique whereby a unitary, typically numerical, score is given for an answer to an open-ended question; for example, in an essay-type response, spelling and grammatical errors and content are all taken into account when granting the score.

The teaching method comprises the step of displaying a student response, which in a particular embodiment may be chosen by the scorer to be presented in handwritten or typed text form, to an open-ended question to a scorer. In a preferred embodiment the scorer is permitted to access for display a scoring rubric for the question, which comprises a set of rules on which the scoring for that question should be based. The scorer then assesses the response and enters a score for the response, which is received by the system. Finally, a model score is presented to the scorer. A comparison of the model score with the scorer's entered score permits him or her to assess his or her scoring efficacy, that is, how close the entered score came to the model score prescribed for the response.

The tutorial software program of the present invention, which may also be referred to simply as a tutorial, in a preferred embodiment comprises a plurality of databases, or, alternatively, a plurality of sectors in a unitary database, containing:

1. A plurality of student responses to an open-ended question. Preferably, each student response is present in an original handwritten image form and in a text form. The text form retains all original errors from the handwritten image.

2. A model score for each student response.

3. A scoring rubric for each question.

4. An analysis of each student response and a rationale for the model score for each student response.

The teaching system of the present invention comprises a computer, or data-processing system, such as, for example, a personal computer or workstation. The computer has resident therein, or has means for communicating with a storage device having resident thereon, a database as described above.

The system also comprises means for displaying a student response to a question to a scorer, means for permitting the scorer to access the scoring rubric for the question, means for receiving a score from the scorer. As described above, these means typically include a personal computer or networked workstation having a keyboard, screen, pointing device, and communication means for accessing a storage device.

Software means are also resident in the computer for presenting on the display means a model score to the scorer to permit the scorer to assess his or her scoring efficacy, that is, how close the assigned score is to the model score. The software means also comprises means for displaying an explanation or annotation of the model score assigned. In addition, means are provided within the processor for tracking the scorer's progress during a practice session with the tutorial. This tracking is preferably accomplished by calculating a running correlation between the model answer and the score entered for each response.

The invention contemplates a system and method for teaching a scorer within a chosen level and discipline. For example, a particular tutorial may comprise a set of questions keyed to a grade level in a particular subject area (e.g., grade 7, history) or in related areas (e.g., grade 8, reading and writing, wherein reading competency is assessed by a student's response to a question on a reading selection, and writing competency is assessed by the student's response to an essay-type question). Alternatively, a set of responses to questions may address the subject matter contained in a professional licensing or qualification examination (e.g., for a laboratory technician).

The features that characterize the invention, both as to organization and method of operation, together with further objects and advantages thereof, will be better understood from the following description used in conjunction with the accompanying drawing. It is to be expressly understood that the drawing is for the purpose of illustration and description and is not intended as a definition of the limits of the invention. These and other objects attained, and advantages offered, by the present invention will become more fully apparent as the description that now follows is read in conjunction with the accompanying drawing.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a logic flowchart for the method of the present invention for teaching a scorer in a holistic scoring technique.

FIG. 2 is a schematic diagram of the system of the present invention.

FIG. 3 illustrates an exemplary opening menu for the tutorial program.

FIGS. 4(A)-4(D) illustrates a series of exemplary answers to a question on a reading selection, representing (A) a high reading; (B) medium reading; and (C) low reading models for a Grade 8 student. (D) A typed text version of the low reading model of (C).

FIGS. 5(A)-5(D) illustrates the first pages of a series of exemplary essays on a prescribed topic, representing (A) a high writing; (B) medium writing; and (C) low writing models for a Grade 8 student. (D) A typed text version of the low writing model of (C).

FIG. 6 represents an exemplary screen displaying a scoring rubric for reading.

FIG. 7 illustrates a model analysis of a response.

FIG. 8 illustrates a cumulative summary table of a scorer's performance.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

A description of the preferred embodiments of the present invention will now be presented with reference to FIGS. 1-8.

In a preferred embodiment of the system 60 of the invention, illustrated in FIG. 2, a person 20 desirous of receiving instruction in holistic scoring is provided with access to a processor. The means or access may comprise a personal computer or a workstation 61 or terminal networked to a server computer 62, or an interface to a remote site through telecommunications, internet, or other forms of data transmission, although these architectures are not intended to be limiting. The processor 62 has loaded thereon the tutorial software program 10 of the present invention, which will be described in the following. The computer access/interface is preferably provided by means well known in the art, e.g., via a display screen 64, keyboard 66, and pointing device 68 such as a mouse, for use in a Windows®-type or Macintosh environment.

A first aspect of the method (FIG. 1) is for teaching a scorer to holistically score an answer or response to an open-ended question (also referred to as an “assessment” or “assessment form” in the art) via a computer-driven tutorial program 10. The method comprises the steps of starting the tutorial program 10 (Step 99) and providing a choice to the scorer 20 of which section or module of the program 10 to enter (Step 100). In a preferred embodiment this choice is presented in the form of a screen-displayed menu (30, FIG. 3) in a Windows®- or Macintosh-type format. This is not intended to be limiting, however, as those skilled in the art will recognize alternate platforms and modes of providing such a selection. In this particular embodiment, two major divisions include introductory (choices 1-3, 211-213) and scoring practice (choices 4 and 5, 214,215) sections.

A first choice from the menu 30 comprises an overview (211, Step 101) of the tutorial 10, which introduces the scorer 20 to basic principles of integrated performance assessment and holistic scoring. The rationale underlying the development of this form of assessment and a general introduction to holistic scoring are offered.

A second choice from the menu 30 comprises a description of a particular assessment (212, Step 102), including its content, how to prepare for scoring responses, such as reading and writing responses to that assessment, and how to apply the rubrics.

A third choice from the menu 30 comprises a guided tour of the scoring section (213, Step 103). This section provides an annotated screen-by-screen preview of the scoring training program.

The practice division begins with a fourth choice from the menu 30, a review of model papers, rubrics, and annotations (214, Step 104). This section allows the scorer 20 to try out the training program's features before entering the actual scoring module (215, Step 105). Here the scorer can explore the rubrics for selected parameters such as, but not limited to, reading, rhetorical effectiveness, and conventions. The scorer 20 can view model student answers to illustrate, for example, high, medium, and low levels of performance. Exemplary responses are presented in FIGS. 4A-C, which represent high, medium, and low model written responses, respectively, to two questions on a reading selection, and FIGS. 5A-C, which represent the first pages of high, medium, and low model written essays on a prescribed topic. Note that in the case of FIGS. 5A-C, a dual score is given, one for “rhetorical effectiveness” and one for “conventions.” In addition, the scorer 20 can read annotations that analyze the answer and explain the scores assigned (FIG. 7).

The final selection offered on the menu 30 comprises the scoring practice module (215, Step 105), in which the scorer 20 can apply what has been learned in the preceding modules 211-214. A plurality of practice answers are provided for each assessment, preferably representing the gamut of “poor” to “excellent” responses.

In the scoring practice module 215 a first student response to an open-ended question is retrieved from a database 250 of student responses and is displayed to the scorer 20 (Step 106). (Here the word student is not intended to be limiting, but should be taken in the broad context of any person taking a test, which could include, for example, a person taking a licensing examination or professional or technical evaluation test.) This step 106 in a preferred embodiment further comprises providing a means for the scorer 20 to select a display mode (Step 107). The display mode can be one of an original handwritten form (Step 108) or a typed (or “keyboarded”) text form (Step 109), wherein the typed text form retains all errors in the original handwritten form, such as spelling, grammatical, syntactical, and punctuation mistakes (see, for example, FIGS. 4C,D and 5C,D, which represent the handwritten and typed text versions of the same responses).

The scorer 20 is permitted at any time during scoring to access a scoring rubric 220 for the question from a scoring rubric database 251 (FIG. 6, Steps 110,111). Each rubric contains an indication of what a numerical score 222 represents, including both a brief descriptor (e.g., “exemplary reading performance” 224) and an extensive description of each score point 226 (see FIG. 6). This scoring rubric is typically accessed by the scorer 20 via selecting an icon on the screen 64 with the pointing device 68, although this method is not intended to be limiting.

Once the scorer 20 has reviewed the response (Step 112), a score is entered (Step 113), for example, by selecting a number from a button bar 642 on the screen 64 with the pointing device 68. Such methods of selecting from a variety of options is well known in the art, and other, comparable selection methods may also be envisioned, such as entering a number from the keyboard 66.

When the score has been entered, a model score 228 is retrieved from a database of model scores 252 and is presented to the scorer 20 (Step 114) to permit him or her to assess the scoring efficacy. In addition, an analysis of the answer is retrieved from a database 253 and is presented (Step 115) on the screen 64 to enable the scorer 20 to review his/her score in light of comments of experienced scorers. In the example of FIG. 7, the analysis covers a student's responses to a number of questions on a reading selection, two of which are included in the high reading model of FIG. 4A. The scorer's score is also stored (Step 116), and a correlation is calculated and presented of that score with the model score (FIG. 8 and Step 117).

In order to refine the skills learned thus far, the scorer 20 will typically choose to practice on further assessments (Step 118), and thus preferably a plurality of responses are available for scoring. As an example, a range of responses representing “low” to “high” models, such as the A-C parts of FIGS. 4 and 5, are available, as well as answers to several different assessments, such as represented in FIGS. 4 and 5, which are responses to reading and writing assignments, respectively.

After entering each score and displaying the model score therefor, the scorer 20 is presented with a cumulative summary table 80 (FIG. 8 and Step 117), which updates and displays the percentage of agreement between the scorer's evaluation and that of an experienced scorer. For example, the scoring status screen of FIG. 8 tabulates for each paper 87 a column for “your score” 81 and a column for a model, or “consensus score” 82. Also presented is a table of “percentage of agreement” 83, including a percentage of “exact agreement” 84 with the model score, a percentage of scores that “differ by 1” 85, and a percentage of scores that “differ by 2 or more” 86. This particular arrangement is not intended to be limiting, as one of skill in the art could imagine any number of similar correlation calculations and modes or presentation. The concept of a summary table is intended to provide an indicator of progress in learning the holistic scoring technique.

If the scorer 20 wishes to end the tutorial session (Step 118), the “Quit” button 216 on the menu 30 may be selected (Step 119).

It may be appreciated by one skilled in the art that additional embodiments may be contemplated, including similar methods and systems for training personnel in scoring open-ended questions for other fields.

In the foregoing description, certain terms have been used for brevity, clarity, and understanding, but no unnecessary limitations are to be implied therefrom beyond the requirements of the prior art, because such words are used for description purposes herein and are intended to be broadly construed. Moreover, the embodiments of the apparatus illustrated and described herein are by way of example, and the scope of the invention is not limited to the exact details of construction.

Having now described the invention, the construction, the operation and use of preferred embodiment thereof, and the advantageous new and useful results obtained thereby, the new and useful constructions, and reasonable mechanical equivalents thereof obvious to those skilled in the art, are set forth in the appended claims. 

What is claimed is:
 1. A computerized method for teaching a scorer to holistically score an answer to an open-ended question, the method comprising the steps of: displaying a model student response to an open-ended question to a scorer; permitting the scorer to access a scoring rubric for the question; receiving a score entered by the scorer; and presenting a model score to the scorer to permit the scorer to assess a scoring efficacy based upon a comparison between the entered score and the model score.
 2. The method recited in claim 1, wherein the displaying step comprises displaying the response in an original handwritten form.
 3. The method recited in claim 1, wherein the displaying step comprises displaying the response in typed text form, wherein the typed text form represents an original handwritten form and retains all original errors.
 4. The method recited in claim 1, wherein the displaying step comprises providing a means for the scorer to select a display mode, the display mode selectable from an original handwritten form and a typed text form.
 5. The method recited in claim 1, further comprising the steps of, preceding the student response displaying step, of electronically displaying a series of images to the user comprising a series of model student answers, corresponding analytical annotations for each answer, and an explanation of a score assigned to each answer, for preparing the user to score a student response.
 6. The method recited in claim 1, further comprising the step of storing the score and a correlation of the score with the model score.
 7. The method recited in claim 1 further comprising the steps of: repeating the student response displaying, rubric access permitting, and score receiving steps for a plurality of student responses; providing a model score for each student response; calculating a summary correlation between the scores and the model scores therefor; storing the scores and the correlation; creating a summary table of the scores and the correlation; and displaying the summary table to the user for providing an indicator of progress in learning the holistic scoring technique.
 8. A method for teaching a scorer to holistically score an answer to an open-ended question, the method comprising the steps of: permitting a scorer to choose a display form from between an original handwritten form and a typed text form, wherein the typed text form represents an original handwritten form and retains all original errors; displaying a student response to an open-ended question to a scorer in the chosen display form; receiving a score entered by the scorer; and presenting a model score to the scorer to permit the scorer to assess a scoring efficacy based upon a comparison between the entered score and the model score.
 9. The method recited in claim 8, further comprising the step of: permitting the scorer to access a model response to the open-ended question for comparison with the student response.
 10. The method recited in claim 9, wherein the model response comprises a plurality of model responses, one at each level of a predetermined succession of performance levels.
 11. The method recited in claim 10, wherein the predetermined succession of performance levels comprises a set of “high,” “medium,” and “low” levels.
 12. A system for teaching a scorer to holistically score an answer to an open-ended question, the system comprising: a computer having resident therein a database of student responses to a plurality of open-ended questions, each student response present in an original handwritten image form and in a text form, the text form retaining all original errors from the handwritten image, the database further containing a scoring rubric for each question; means for displaying a student response to a question to a scorer; means for permitting the scorer to access the scoring rubric for the question; means for receiving a score for the student response from the scorer; and means for presenting on the display means a model score to the scorer to permit the scorer to assess a scoring efficacy based upon a comparison of the model score with the received score.
 13. The system recited in claim 12, wherein the display means and the access-permitting means comprise a workstation having a screen and a keyboard, the workstation in electronic communication with the computer.
 14. A tutorial program for teaching a scorer to holistically score an answer to an open-ended question, the tutorial program comprising: a first database containing a plurality of student responses to an open-ended question, each student response in an original handwritten image form and in a text form, the text form retaining all original errors from the handwritten image; a second database containing a model score for each student response; and a scoring practice module comprising: means for receiving a request from a scorer to view a student response; means for receiving a choice of display form from the scorer; means for retrieving the student response in the chosen display form from the first database and for generating a screen display thereof; means for receiving a score from the scorer; and means for retrieving from the second database and for generating a screen display of a model score for the displayed response.
 15. The tutorial program recited in claim 14, further comprising a third database containing a scoring rubric for each question; and wherein the scoring practice module further comprises: means for receiving a request from the scorer to view a scoring rubric for the displayed response; and means for retrieving the requested scoring rubric from the third database and for generating a screen display thereof.
 16. The tutorial program recited in claim 14, further comprising a fourth database containing an analysis of each student response and a rationale for the model score for each student response; and wherein the scoring practice module further comprises means for retrieving and generating a screen display of the analysis and rationale for the displayed student response.
 17. The tutorial program recited in claim 14, wherein the scoring practice module further comprises means for calculating a correlation between the entered score and the model score.
 18. The tutorial program recited in claim 17, wherein: the means for receiving a request from a scorer to view a student response comprises means for receiving a succession of requests from the scorer to view a series of student responses; the score-receiving means comprises means for receiving a score for each student response; and the correlation calculating means comprises means for calculating a summary correlation between the received scores and the corresponding model scores.
 19. The tutorial program recited in claim 18, wherein the scoring module further comprises means for displaying the calculated summary correlation to the scorer.
 20. The tutorial program recited in claim 19, wherein the summary correlation comprises a chart comprising a list of percentage of agreement values.
 21. The tutorial program recited in claim 20, wherein the list comprises a first entry comprising a percentage of exact agreement with the model answers and a second entry comprising a percentage of entered scores that differed from the model answer by a predetermined integral value. 